Quickest Detection with Social Learning: Interaction of local and global decision makers
نویسنده
چکیده
We consider how local and global decision policies interact in stopping time problems such as quickest time change detection. Individual agents make myopic local decisions via social learning, that is, each agent records a private observation of a noisy underlying state process, selfishly optimizes its local utility and then broadcasts its local decision. Given these local decisions, how can a global decision maker achieve quickest time change detection when the underlying state changes according to a phase-type distribution? The paper presents four results. First, using Blackwell dominance of measures, it is shown that the optimal cost incurred in social learning based quickest detection is always larger than that of classical quickest detection. Second, it is shown that in general the optimal decision policy for social learning based quickest detection is characterized by multiple thresholds within the space of Bayesian distributions. Third, using lattice programming and stochastic dominance, sufficient conditions are given for the optimal decision policy to consist of a single linear hyperplane, or, more generally, a threshold curve. Estimation of the optimal linear approximation to this threshold curve is formulated as a simulation-based stochastic optimization problem. Finally, the paper shows that in multi-agent sensor management with quickest detection, where each agent views the world according to its prior, the optimal policy has a similar structure to social learning. Index Terms Quickest time Bayesian change detection, social learning, phase-type distribution, stochastic dominance, Blackwell dominance, multi-agent sensor scheduling, partially observed Markov decision process This work was partially supported by NSERC. V. Krishnamurthy is with the Department of Electrical and Computer Engineering, University of British Columbia, Vancouver, V6T 1Z4, Canada. (email: [email protected]).
منابع مشابه
Networks of Biosensors: Decentralized Activation and Social Learning
This paper deals with the dynamics of biosensors and networks of biosensors, where individual biosensors are constructed out of protein molecules. Such biosensors are fully functioning nano-machines. The paper explores methods for decentralized self-activation of networks of biosensors using game-theoretic methods. A global game analysis in terms of a Bayesian game, and a correlated equilibrium...
متن کاملSequential Event Detection Using Multimodal Data in Nonstationary Environments
The problem of sequential detection of anomalies in multimodal data is considered. The objective is to observe physical sensor data from CCTV cameras, and social media data from Twitter and Instagram to detect anomalous behaviors or events. Data from each modality is transformed to discrete time count data by using an artificial neural network to obtain counts of objects in CCTV images and by c...
متن کاملBayesian Sequential Detection With Phase-Distributed Change Time and Nonlinear Penalty—A POMDP Lattice Programming Approach
We show that the optimal decision policy for several types of Bayesian sequential detection problems has a threshold switching curve structure on the space of posterior distributions. This is established by using lattice programming and stochastic orders in a partially observed Markov decision process (POMDP) framework. A stochastic gradient algorithm is presented to estimate the optimal linear...
متن کاملBayesian Sequential Detection with Phase-Distributed Change Time and Nonlinear Penalty -- A POMDP Approach
We show that the optimal decision policy for several types of Bayesian sequential detection problems has a threshold switching curve structure on the space of posterior distributions. This is established by using lattice programming and stochastic orders in a partially observed Markov decision process (POMDP) framework. A stochastic gradient algorithm is presented to estimate the optimal linear...
متن کاملAsymptotically Optimal Quickest Change Detection in Distributed Sensor Systems
In the standard formulation of the quickest change-point detection problem, a sequence of observations, whose distribution changes at some unknown point in time, is available to a decision maker, and the goal is to detect this change as quickly as possible, subject to false alarm constraints. In this paper, we study the quickest change detection problem in the setting where the information avai...
متن کامل